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Abstract 

The paper deals with variational approaches to the segmentation 
of time series into smooth pieces, but allowing for sharp breaks. In dis- 
crete time, the corresponding functionals are of Blake-Zisserman type. 
Their natural counterpart in continuous time are the Mumford-Shah 
functionals. Time series which minimise these functionals are proper 
estimates or representations of the signals behind recorded data. We 
focus on consistent behaviour of the functionals and the estimates, as 
parameters vary or as the sampling rate increases. 

For each time continuous time series / G L'^{[0, 1]) we take condi- 
tional expectations w.r.t. to cr-algebras generated by finer and finer 
partitions of the time domain into intervals, and thereby construct a 
sequence (/n)neN of discrete time series. As n increases this amounts 
to sampling the continuous time series with more and more accuracy. 

Our main result is consistent behaviour of segmentations w.r.t. to 
variation of parameters and increasing sampling rate. 

Keywords: Segmentation, Blake - Zisserman and Mumford - Shah func- 
tional, F— Convergence, Hausdorff - metric 

MSG: 49J45, 49J52, 93E14 
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1 Introduction 

We will first introduce and discuss the adopted concept of segmentation, and 
define the functionals we are dealing with. Then the main result will be 
stated, and finally, we sketch the plan of the paper. 

1.1 Segmentations 

A fundamental variational ansatz for the segmentation of time series is the 
minimisation of functionals which penalise undesired properties of the esti- 
mate against fidelity to the data. The latter will be measured by the L^- 
distance of the estimate and data. The penalty should depend on 

(i) the measure of the set of 'jumps' or 'breaks' between regions of 'smooth- 



(ii) a notion of smoothness which restricts the behaviour of the signal be- 
tween two subsequent jumps. 

The notion of 'jump' and 'smoothness' will be made precise shortly. Let us 
illustrate the concepts by way of two examples. Suppose that the segmenta- 
tion is a square integrable function g on the closed unit interval U = [0, 1]. 
As segmentations, we allow functions f = t + F with a right continuous step 
function t and a Sobolev differentiable function F. These functions are of 
special hounded variation, see Section 12.41 The Mumford-Shah functionals in 
time dimension one are defined as 



where j{t) is the number of discontinuities of t in (0, 1), /' is the Sobolev 
derivative of F, and the parameters 7 and n control the number of breaks 
and the degree of smoothness. This is the one- dimensional version of the 
functionals introduced in and [ISj for space dimension two. 

In this model, jumps are the discontinuities oft which by Sobolev 's embed- 
ding theorem can be identified with the discontinuities of /. 'Smoothness' is 
measured by the L^-norm of the derivative of F within the intervals between 
subsequent jumps; it coincides there with the derivative of /. Fidelity to the 
data, finally, is measured by the L^-distance of the segmentation to data. 



ness , 
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The second example - which is in fact a special case of the Mumford 

- Shah functional if /i = - is called Potts - functional, inspired by the 
functional introduced in ^1] as a generalization of the Ising model (0) in 
statistical mechanics. It is defined for step functions t and given by 

P^,,(t) ■.= jj{t) + \\t-gf. 

The notions of jump and of fidelity to the data is the same as for the Mumford 

- Shah functional, whereas the notion of smoothness is considerably stronger, 
only functions which are even constant on the intervals between consecutive 
jumps are possible outcomes of the segmentation procedure. The Mumford - 
Shah functional (and hence the Potts-functional as well) can be extended to 
L^(f/) in the sense that it is given by the expression above, if the equivalence 
class of / G L'^{U) contains a function which can be written as f = t + F, 
and that it is equal to oo, if not (cf. Definition [TUI). 

In contrast to the continuous setting, there are no 'obvious' notions of 
smoothness or jumps for discrete time - series g = (g^, ...,g"'~^) G M". One 
possibility is to consider those points k G {0,...,n — 2} as jumps, where 
the difference If^"^^ — f^l exceeds a given threshold (cf. Definition IH)) • One 
particular functional, where this notion is immanent is the Blake - Zisserman 
functional (see jS], For 7, /i > it is given by 

n— 2 n—1 

Bz,,,,,(/) := 5^min{ir+i - rw + ^^(r - g""?- 

K=Q K = 

An equivalent definition (cf. Lemma EI) focussing more closely on the nature 
of jumps (edges') in this functional by showing that the min' in the penalizing 
term arises from a minimization with respect to a larger space of covariables 
including explicit information about jumps', was given in [F. As for the 
Mumford - Shah functional, the special case /U = 0, i.e. 

n-l 

is called (discrete) Potts model. It also favors segmentations / which are 
constant between consecutive jump points. 

In the variational ansatz, we consider as segmentations those functions 
or vectors, respectively, which minimize the given functionals. That imme- 
diately leads to the following questions: 
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(i) Is there always a minimizer of the given functional ? 

(ii) How does it depend on the (free) parameters 7 and /i ? 

In view of the discussion about jumps and smoothness above, we may also 
ask a question about the relation between the discrete and the continuous 
setup: 

(iii) Is there an embedding of the discrete situation into the continuous one, 
i.e. a way of discrete sampling from a continuous 'truth', such that the 
discrete segmentations converge to the segmentations of the continuous 
signal ? 

One exact formulation of question (iii) and an affirmative answer about the 
relation of Mumford - Shah and Blake - Zisserman functional is the main 
subject of this paper and is provided by Theorem [T] in the next subsection. 
As a byproduct of our analysis, we obtain in that case answers to questions 
(i) and (ii) as well. 

1.2 The Main Result 

For an exact formulation of the third question, we have to specify the embed- 
ding of the discrete into the continuous situation. Since we are considering 
signals g G L'^{U), it is not suitable to consider vectors arising from the 
evaluation of a signal g at several distinct time points. Instead, we con- 
sider its conditional expectation with respect to the a- algebra generated by 
a fixed partition of U = [0, 1]. The intuition behind this procedure is, that 
the output of our measuring device is truly an average of the signal over a 
short period of time. Sampling with more and more accuracy thus means to 
decrease the length of these periods. We adopt the following conventions: 

Definition 1 Let n E N and the equidistant setup be given by 

Sin) :={^ ■.K = 0,...,n}cU 

and an denote the a-algebra 

a„:=a([0,i),[i,i),...,[i^,i^),[^,l]). 

The conditional expectation of a signal g G L'^{U) with respect to an will be 
shortly denoted by gn := E{g \ an)- 
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Remark. The fact that the rightmost interval is closed has no further signif- 
icance. We found that it is most convenient to deal with the right boundary 
point in the way that we consider only step functions which are continuous 
at s = 1. 



As an appropriate discrete input for the Blake - Zisserman functional we 
consider now the signal g = 7r„gi := {g'^, ...,gl^~^) e given by 



K + l 

9n ■= E{g I (T„)(^) =n j^" g{s)ds. 



A one - sided inverse to this discretization map is provided for / G M" by the 
step function 



n-2 



K=0 

Remcirk. Note that for the conditional expectation of a signal g E L'^{U), 
we have gn — E{g \ an) = t„ o Tr^g^. This fact will be used frequently in the 
sequel. 

As a consequence of our treatment of the boundary point s = 1 - which 
implies that all (j„-measurable functions are right- continuous step functions 
with left limits which are additionally continuous at s = 1 - we will assume 
this property as well for all step functions considered in this paper. Hence 
we arrive at the following definition. 

Definition 2 (1) We denote by T{U) the set of all right- continuous step 

functions on U with left limits which are additionally continuous at 1 E U. 
(2) We denote by Tn{U) C T{U) the set of (Tn-measurable functions on U . 

With these conventions in mind, we can now define an embedded version 
of the Blake - Zisserman functional and of the discrete Potts - Functional on 
L\U). Let := rj = (f, f^'^) e M" given by ^ := /(s) if / e %{U) 
and g given loy the conditional expectation above. Then 



Ryn fr._(^'Z'^,n/n,g/M^nf/Vri) H f & %{U) 

BZ,,^,,(/) - I ^ ^^^^ 
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and 



7,9, 
OO 



J/v^) 



if/GT„(f/) 
else 



respectively (cf. Definition II ip. 

To be more precise, we embed the functionals defined above into a family 
of functionals depending on two real and one additional rational parameter 
which refiects the transition from discrete to continuous. Let 

T := {1/n : n G N} U {0} C M 

be equipped with its relative topology as a subset of M, i.e. is the only 
accumulation point. Consider the (pseudo-) cube Q = Mq x Mq x T. To 
each point of the cube corresponds one functional as follows: 
Let (7,/i, t) G Q and g G L'^iJJ). We adopt the convention that t = 1/n, 
= 1/00, respectively. Then, the functions F{pf,iJi,t) : L'^iU) M are given 

by 

MS, 
P! 



F(7,/i,t) := 



■'7.M.9 



7.9 
7.9 



d" 
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7 > 0,/i > 0,t > 
7 > 0,/i > 0,t = 
7 > 0,/i = 0,t > 
7 > 0,/i = 0,t = 
7 = 0,/i > 0,t > 
7 = 0,/i > 0,t = 



Here BZ 



7:/i:9' 



MS 



7.M.9 



are the Mumford- Shah- and Blake- Zisserman func- 
tionals which were shortly discussed above and which precise form extended 
to functionals on L'^{U) is given in the Definitions ^1 below. The 
Potts- functionals P" ^ and P^^g above, the discrete distance functional 

d" : L2(f/) M is given by 



ci:(/) : = 



iTVoU - 9n)\-) fe%{U) 

else 



00 



and finally the continuous distance functional dg : L'^{U) — > M given by 

daU) ■■= I U-9?ds. 
Ju 

With these definitions, the main result of this paper reads as follows: 

Theorem 1 For the family F{q) : L'^{U) —>■ W,q E Q, the following state- 
ments hold: 
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(i) For all q & Q there is a minimizer of F{q). 

(a) Let Qs = (7s,/is,ts), s G N converge to q = (7,/i,t) in Q as s tends to 
infinity. Denote by f^sm ^ sequence of minimizer s of F{qs). Then: 

(a) Every convergent subsequence 0/ converges to a minimizer 
ofF{q). 

(b) fssf^N o-lways contains a convergent subsequence. 

That means, segmentation of a time series using the variational ansatz 
with these functionals behaves consistently under variation of the parameters 
and under sampling the true signal with more and more accuracy. 

1.3 Plan of the Paper 

The proof of our main result is mainly based on the fact, that the minimi- 
sation of the functionals under consideration can be splitted up: Once a set 
of jumps is fixed, there is a minimizer of the functional among all admissible 
functions whose jump set is contained in the given one. This minimizer can 
be computed rather explicitly. The minimization of the functional over all 
possible segmentations reduces thus to the minimization with respect to all 
jump sets, which will be identified with partitions of U or, respectively, its 
associated minimizers. In the sequel, this common feature of the functionals 
will be called the reduction principle. The minimizers associated to a fixed 
jump set will be called partition solvers. 

In the following section, we begin by fixing the notion of partition which 
is basically the exact manifestation of jump set', together with some of its 
properties as finite sets. Furthermore, we make precise our point of view by 
defining what we understand as a segmentation. As its most important mani- 
festation in this paper, we consider functions of special bounded variation. In 
Sectional we give an exact statement of the reduction principle and compute 
the partition solvers. Continuity properties of the latter ones are proved in 
the subsequent section. The first important implication of these considera- 
tions is relative compactness of the set of minimizers derived in Section El 
As explained at the beginning of Section IHl F-convergence together with this 
compactness result implies Theorem ^ To establish the F-convergence result 
Theorem in the final section, the basic ingredient is an associated result of 
independent interest, given by the Lemmas IT51 and 1141 on the interchange- 
ability of Sobolev differentiation and approximation by step functions. This 
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is proved in the remainder of Section [Ul We end up with a (then) short proof 
of our main result Theorem ^ 

2 Segmentations and Partitions 

In this section, we introduce what we mean by the segmentation of a signal 
which depends on one parameter, for instance a discrete or continuous time- 
series. 

2.1 The Continuous Case 

In that case, the signal depends on a continuous set of parameters. We 
restrict ourselves to square integrable signals. 

Definition 3 Let U := [0, 1] and L'^{U) denote the Hilbert space of (equiv- 
alence classes) of square integrable functions on U with respect to Lebesgue 
measure. A signal is a function g G L'^{U). A segmentation class on U is 
a class S{U) of (equivalence classes) of right- continuous functions with left 
limits, i.e. there is an infective map S(f/) C D(f/) where D(f/) denotes the 
Skorohod space (see e.g. J^). 

We have the following examples for segmentation classes: 

Example.(l) S{U) = D{U). (2) S{U) = T{U), the space of all right- 
continuous step functions with left limits which are even continuous at 1 G ?7. 
(3) S{U) = SBV2(t/), the functions of 2-bounded special variation discussed 
below. 

Intuitively, a segmentation provides a decomposition of a signal into ho- 
mogeneous parts which are separated by abrupt changes {jumps). The de- 
composition idea is represented by the concept of partitions. For reasons of 
technical convenience, we decided that all partitions contain the boundary 
points of /. 

Definition 4 Let \M\ denote the cardinality of a subset M G U. (1) The 
partitions of the interval U are given by the set 

V{U) := {p C [0, 1] : \p\ < oo; 0, 1 G p}. 



10 



(2) The special closed subsets of the interval I are given by the set 

C{U) := {c C [0, 1] : c closed; 0, 1 G c}. 

(3) The partition p{f) associated to a given f G D(f/) with finite set of 
discontinuities disc(/) is p{f) := disc(/) U {0, 1}. 

In the sequel, the points in disc(/) are frequently called jumps. 
2.2 The Discrete Case 

In the discrete case, we still think of a continuous signal in the sense defined 
above. In contrast to the continuous case, we can only sample its values at 
finitely many (equidistant) time points. Hence, partitions into continuous 
parts separated by discontinuities make no longer sense and we have to sub- 
stitute it by something else. Recall the definitions [T] and El of the equidistant 
setup and the step functions associated to it. 

Definition 5 By 

Vn{U) ■.= {peV{U) : p(lS{n)] 
we denote the partitions compatible with the equidistant setup S{n). 

So far, we did not say anything about a decomposition of the signal into 
more and less homogeneous parts. In the continuous case, this was achieved 
with the help of a partition associated to the segmentation function. We will 
do the same now by introducing a suitable threshold. 

Definition 6 A threshold is a function T : S{n) M.^ . The T-partition of 

a function f G Tn(U) is given by 

PTif) ■■= {f : |/(^) - /(f )| > TiK); K = 1, n - 1} U {0, 1} G P„(f/). 
Points in prif) o'^e called discrete discontinuities of f (with respect to T). 

In that sense, we consider those points as points of discontinuity where 
the difference of values at adjacent sampling points exceeds a given threshold. 
Note that the threshold may as well be adapted, i.e. depend on the function 
under consideration. 
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Example. (1) For T = 0, all points where /(■^) 7^ /(f) are discrete dis- 
continuities. This choice corresponds to the consideration of the minimizers 
of the Potts functional, which are constant off the jump set. (2) For the 
Blake-Zisserman functional to parameters 7,/^, n (see below), the appropri- 
ate choice is T{k) = /i >/^yJn. 

2.3 Partitions and HausdorfF Metric 

A topology on the set of closed subsets of U is provided by the Hausdorff 
metric. 

Definition 7 Let c, c' be closed non-empty subsets of U. Then the distance 
dnic, c') := max < maxmin \x — maxmin |x — ?/| > 

is called Hausdorff distance. 

The properties of the HausdorfF distance are summarized by the following 
proposition, the proof follows from standard facts, available for instance in 
in or [Hi- 
Proposition 1 (1) The closed subsets of U, provided with Hausdorff dis- 
tance, form a compact (hence complete) metric space. (2) The set C{U) is 
a closed subspace, hence as well compact. (3) The subset V{U) C C{U) is 
dense, i.e. V{U) = C{U). (4) The subset VniU) C C{U) is finite, hence as 
well closed and compact. 

In the sequel, we will need another characterization of Hausdorff convergence 
of partitions focussing on the decomposition of U into intervals. 

Definition 8 Let p G V{U). By p'^ := U — p we denote the complement of 
p in U. The complement of p = {0 = Xq < Xi < ... < = 1} zs a disjoint 
union of open subintervals Q G U. The collection of these subintervals will 
be denoted by 

l{p) := {6 C / : 6 = {xk, Xk+i) ■ /c = 0, m - 1}. 
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Now the characterization of Hausdorff convergence in terms of the intervals 
reads as follows. 

Lemma 1 Let Pn G ^{11) converge to p = {0 = xq < xi < ... < Xm — i} & 
V{U) in Hausdorff metric. Then we have 

(i) For all e i{p) there is a sequence 0„ G i{pn) such that 0„ converges 
to in Hausdorff metric. 

(a) On = [on; ^n] convcrgcs to Q = [a, b] if and only if an ^ a and bn —>■ b. 

(Hi) Let 0,0' e l{p) two adjacent intervals, i.e = {x,y), 0' = {y,z). Let 
0n = {xn,yn), 0'* = {y'w^n) t^c scquenccs of intervals from (i). Then 
[Vn, y'n\ — ^ {y} in Hausdorff' metric and y'n y uniformly for all 
adjacent 0, 0' e 



Proof: Let 60 := 1/3 min^;^^^? \x~y\ ^-^d = (a, b) G l{p) be fixed. Without 
loss of generality, assume that n is so large that dniPnyP) < ^o- (h) By a < b, 
Qn < bn and by the assumption, we have necessarily |a — a„|, |6 — 6„| < Sq 
and thus by definition of the Hausdorff metric 

dif([a, 6], [a„, = max < max min \x — y\, max min |x — |/| > 

\^xe{a,b} yG{an,b„} y&{an,bn} xe{a,b} J 

= max{|a„ - a|, |6„ - 6|} . 

(i) We now construct the sequence 0„ by letting 0„ = (a„, 6„) where 

Qn '■— max{a:; & Pn '■ x < a + 5o} bn '■— min{a:; & Pn '■ x >b — 5q\. 

By the assumption made above, both sets are non - empty and by Hausdorff 
- convergence we have a„ — > a, 6^ — > b. Thus, by (ii), 0„ converges to 0. 
(iii) By construction of the sequence in (i), always yn < y'n- But by (ii), 

yn, y'n^y and 

\yn - y'n\ < \yn -y\ + \y- y'n\ < ^dniPn,?)- 

■ 

As a first application, we derive that the counting function is lower semi- 
continuous. 
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Corollary 1 The function | — | : V^U) — N zs lower semi- continuous if 
V{U) is equipped with the HausdorfJ topology. 

Proof: The intervals G„ G L{pn) approximating a given interval G i{p) 
constructed in Lemma ^ can be chosen disjoint for different B. The left 
boundary points of the approximating intervals are elements of p„ and all 
different. Thus > bl- ■ 



2.4 Functions of Special Bounded Variation 

Now we consider a special segmentation class, namely functions which are 
of special bounded variation. They are defined as follows: Recall that a 
right-continuous function of bounded variation / G BV(f/) with left limits 
defines a signed measure u G Ai{U) which is uniquely determined by it 
values i/([a, 6)) := f{b) — f{a) on half-open intervals. Recall further, that by 
Lebesgue decomposition (see for instance [17j, Theorem 1.13, 1.14, p. 22) there 
are three uniquely determined measures psi^Ri where u = us + i'r + 
and 

(i) i^s = "^Cx 6x is a sum of point measures, 

(ii) upi « A is absolutely continuous with respect to Lebesgue measure 
with Radon-Nikodym derivative f = duji/dX, 

(iii) I/-*- ± A is singular to Lebesgue measure without point measures, i.e. 
i/-^{{x}) = for all X eU. 

Definition 9 (1) A function f G BV(f/) is called to be o/ special bounded 
variation, i.e. / G SBV([/), if in the decomposition above we have = 0. 
It is called to be o/ p-special bounded variation, i.e. / G SBVp([/), if in 
addition, f G L^{U) and - in contrast to the usual convention - if the 
support supp (z/5) is a finite set. (2) The partition associated to a function 
f G SBV2(f/) is given by p{f ) := supp (1^5) U {0, 1}. 

It is not yet obvious that these functions form indeed a segmentation 
class in the sense that they are equivalent to piecewise continuous functions 
as assumed in Definitional We will show this for p = 2 and will characterize 
them as piecewise Sobolev-functions. 
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Lemma 2 (1) Let f G SBV2(f/). Then there are (up to an additive con- 
stant) uniquely determined functions t G T{U), the space of step-functions 
introduced above, and F G H^{Vt), the Hilbert-Sobolev space of one time 
generalized differentiable functions with square integrable derivative on the 
open interval Q := (0,1), such that f = t + F. (2) In the equivalence 
class of F E H^{Q), there is a continuous representative as well denoted by 
F G C{U). (3) The partition p{f) associated to f E SBV2(f/) coincides 
with disc(t) U {0, 1}, where disc(t) denotes the points of discontinuity of the 
corresponding step function. 

Proof: (1) Let Df denote the distributional derivative (measure) of /. By 
assumption Df = /' ■ A + us- Then the distribution function of the singular 
part Us is a right continuous step function with left limits t G T{U). Hence 
D{f -t) = f -X with l/f < oo. Hence F := f - t e H\n). (2) The 
second statement follows from Sobolev's embedding lemma (see |15j, Thm. 
IX. 24, p. 52). (3) This follows from the fact that the step function is the 
distribution function of the point measure. ■ 

Thus, functions of special bounded variation are indeed right-continuous 
with left limits and hence form a segmentation class. 

3 The Reduction Principle 

The reduction principle consists of the fact that both, the Mumford - Shah 
and the Blake - Zisserman functional - except for the degenerate case 7 = 
where this property is still true in a restricted sense - have the following 
property: 

Among all segmentations associated to a fixed partition of the interval, the func- 
tional assumes a unique minimum. The minimizer for a fixed partition can be 
computed by separately minimizing independent functionals associated to the in- 
tervals of the given partition. 

Hence - by the first property - the problem of minimizing the whole func- 
tional can be reduced to the problem of minimizing a reduced functional 
which is a function of partitions rather than segmentations. Then, simple a 
priori bounds on the number of jumps can be used to show that this partition 
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space is essentially compact. That, in particular, provides the existence of 
global minimizers. Considerations like that are the subject of this paper. 

The second property is more important from the algorithmic point of 
view. For discrete time-series and the Potts model this property was used 
to establish an efficient algorithm to compute the minimizers, see e.g. [22] 
or the PhD-Thesis jH]. In the very recent PhD-Thesis jS], this property is 
used to construct efficient algorithms in 2D when the admissible partitions 
are restricted to certain subclasses (cf. the formulation of the reduction 
principle in Definition 1.2.1). 



3.1 The Reduction Principle for Mumford - Shah 

We start with the statement of the reduction principle for Mumford - Shah. 
First of all, we give the exact definition of the functional already discussed 
above extending it to a functional on L^(f/). Recall that functions of special 
bounded variation provide a segmentation class as explained in Section ITU 

Definition 10 Let g G L'^iU), 7,yU > 0. The Mumford - Shah functional 
MS^^^_g : L^{U) ^ M to signal g and parameters 7,/i is given by 

MS, an - 1 '"'-'^ ^ * ^" ^"'"^ ^ - { ' (1) 

1^ 00 else 

where j{f) := \p{f) \ — 2 is the number of jumps. 

As proper segmentations of the signal g, we consider minimizers of the 
functional, i.e. 

f*,^.,oo,g ■= argmin^g^2(c;)MS^,/,,g(/). 

The minimizer is not necessarily unique. The starting point for the reduction 
principle is the fact that we may split up the minimization procedure by the 
following observation: Let 



/;,oo,c;(p) = argmin|^^p(^)cp| 



4 / \ffds+\\f-9\ 
Ju 



(2) 



The key point, however, is that this minimizer for a fixed partition exists, is 
unique and can be computed explicitly due to a decoupling of the minimiza- 
tion procedure for the different intervals in the complement of the partition. 
Then, the global minimizer is given by 

f^,,i,oo,g = argmm^g|..^^ (p)^pgp(^)|MS^,^,g(/). (3) 
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That means, the minimization of the functional can be reduced to the mini- 
mization on a much smaller subspace of the space of all segmentations. This 
subspace is an image of the space of partitions under the (in general not 
injective) map p ^ f*,^^g{p). 

First of all, we compute the unique minimizer in equation (j21). 

Proposition 2 Let 7, /i > 0, p G V{U) be fixed. Then the unique minimizer 



\fVds+\\f-g\ 



can he constructed as follows: Let G i{p). Denote by Ae the Laplacian on 
9 with Neumann boundary conditions. Then the function 

fe:=-fi'R{Ae, 12^)9 

where i?(Ae,yU^) denotes the resolvent of the Laplacian, is continuous on 
0. Denote by G"*", the closed interval O with right boundary point removed. 
Then 



Remark, (i) Note that pUl.,oo,gip')) C p' but that p{f^^oo,gip')) can be 
strictly smaller. That explains why the map p ^ f^oogip) is not injective 
and was the reason to put {/ : p{f) C p} in equation (ii) Having in 
mind the decomposition L'^iJJ) = ©0et(p)-/^^(0'^) we could write shortly 

f;,oo,gip) = Ri^e,fi')9 

0et(p) 

where we identify R{Aq, fj,'^)g G H^{Q) with its image in C{Q~^) according 
to the Sobolev embedding theorem. The second property of the reduction 
principle is reflected by the fact that the operator assigning to the signal 
g the minimizer f^ oo,gip) decomposable (see for instance [THI, p. 284 ff.) 
with respect to the orthogonal sum decomposition L'^iJJ) = ©eet(p)-^^(0''')- 
(3) Note that the case = is included in the preceding considerations 
in the following sense: If /i = 0, the penalization for SBV-functions with 
non - vanishing derivative tends to infinity, hence constant functions /e are 
favored in this case. According to this, the operator — /i^ R{Aq, fi"^) tends to 
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the projection onto the kernel of Ae which consists of all constant functions 
on 9. 



Proof: Minimizing the Mumford - Shah functional for a fixed partition 
means minimization of the expression 



E / dx{f^-'\Q' + \fe-9\') 



Te=0 



eet{p) 

where we first consider /e 0gt(p) to be a tuple of functions /e G C°°(0). 
Let /ie,eet(p) be another such tuple and re,0et(p) E M}'-^^^^. The minimization 
condition reads 

= ^ J^dx{fi-'\f^ + Teh'^\' + \fe + Tehe-9\') 

= 2 / dx{ft-'f^h'^ + fe-g) 
Je 

= 2{-fi-^f^,h'^) + {fe,he)-{g,he)). 

Hence, adopting the notation from [Idj, Section 5.7, p. 345 ff, we may con- 
sider the extended map —^"^Cn + 1 from H^{Q) to the dual space H^{Q)* 
defined by the relation 

{{-fi'CN + l)/e, he) := h'^) + (/e, he). 

By [IS], Proposition 7.2, p. 346, for /i > 0, (7 G L'^{U), the equation is solved 
by a unique fe G H'^{Q) satisfying 

-fi'fe + fe = 9 one 
felds = 

where the second equation is understood in terms of the trace map (see 
[in]. Proposition 1.6, p. 273. Thus, the Euler equation can be equivalently 
described by (Ae — /U^)/e = —fJ''^g where Ae is the Neumann laplacian on 
0. Hence 

fe = -fi^Ae-fiY'g = V^(Ae,/i')^7 
where R denotes the resolvent. By Sobolev's embedding theorem, 

/eGdom(Ae)Cifi(e)cC(e), 



18 



and /* provides indeed a right-continuous solution. In the case /i = 0, the 
minimum can only be assumed by locally constant functions, i.e. /@ = 0. 
Hence, in that case the only non-trivial variations of /e are given by constant 
functions he as well. Thus, the solution to the variational problem is given 
by the constant function assuming the mean value of g on 0, i.e. 

(/e, he) = fehei^, 1) = {g, he) = he{g, 1) 

and therefore fe = 1). The minimizer depends thus even con- 

tinuously from /i, since the resolvent function considered above tends to the 
projection onto the kernel of the Dirichlet Laplacian as /i tends to zero. Since 
the kernel consists of constant functions, this coincides with the mean value. 



Hence, the precise formulation of the reduction principle in the case of 
the Mumford - Shah functional is given by 

Corollary 2 (Reduction for Mumford-Shah) Let 7,/i > 0. The mini- 
mization of the Mumford - Shah functional is equivalent to the minimization 
of the reduced Mumford - Shah functional ms^^^^g : V{U) M given by 

ms^,^,g(p) := MS^^^^g{f*,^g{p)) = -fj{p) - {u, f*^^^g{p)) + \\gf, 

more precisely, p is a minimizer ofms^^^^g if and only if f^^oo,gip) ^-^ ^ mini- 
mizer of MS^^^^g. 



3.2 The Reduction Principle for Blake - Zisserman 

The reduction principle for the Blake - Zisserman functional is similar to the 
one for Mumford - Shah. We just have to identify the respective quanti- 
ties in the discrete setting. Again, we start by giving our definition of the 
functional. Recall the notions of equidistant setup and conditional expectation 
from Definition n and the classes of considered step-functions from Definition 

m 

Definition 11 Let n E g e L'^{U) and 7,/i > &e fixed. The Blake- 
Zisserman functional BZIJ^^ : L'^{U) ^ M zs given by 

BZ- (f).= S'^Uf) + ^T,:-Jo{f-9n)\-)fe%{U) 
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where 

n-2 

and Qn = E{g \ an) denotes conditional expectation. 

Again, we seek for minimizers. The starting point for the reduction prin- 
ciple in this case will as well consist of an observation concerning the mini- 
mization of the functional. Let 

:= argminBZ!;,^,^(/) 

be a (not necessarily unique) minimizer. Then we have 

Lemma 3 The minimization of BZ" is equivalent to the minimization of 
the functional HJ^^g : 1j2~^ ^ Tn{U) ffi given by 

rt— 1 

where Z2 := {0, 1}, e = (ei, Cn-i) and 

n-2 

K=0 

Proof: See |22|, p. 36 f. ■ 

The points ^ such that = 1 correspond to edges in the segmentation 
of Un (see again the presentation in [22], P- 36 f). They correspond exactly 
to those points ^, where the minimizer fulfills |/*_^ „^^(is±i) - /*^^_„^g(f^)| > 
/i \/'y/n, i.e. they represent the discrete discontinuities of /^,^,„,c, with respect 
to the threshold T = /x \/'y/n as defined in Definition IHl Furthermore, there 
is a bijection onto associated partitions 

Pie):={f^ : = 1} U {0, 1} G K(t/) 

and these form the discrete analogue of the partitions in the continuous case. 
Using the lemma above, the starting point for the reduction principle is 
provided as in the continuous case by the following observation: Let 

n—2 ^ n— 1 

K=0 K=0 
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and 

/;,n,,(p) = argmin^^^p^(^)cp|Q;:(e, /). (6) 

Again, the minimizer for a fixed partition exists, is unique and can be com- 
puted explicitly due to a decoupling of the minimization procedure for the 
different intervals in the complement of the partition. Then, the global min- 
imizer is given by 

/7,/.,n,g = argmin^g|^,^^(p)^pgp^(^)|BZ"^^g(/). (7) 

First of all, we recall the decomposition of the conditional expectation 
map that was already introduced in Section 11.21 

Definition 12 The map 7r„ : L'^{U) is given by the row 

T^ng ■■= igniO),gnil/n), ...,gn{n - I / u)) 

where gn '■= E{g \ cr„) denotes conditional expectation. The map r„ : M" — >■ 
L'^{U) is given by 

n-2 

K=0 

Remark. Clearly, gn = '^n ° T^ng- 

As in the continuous case, we start by constructing the unique minimizer 
associated to a fixed partition. 

Proposition 3 Let 7,yU > and p G Vn{U) be fixed. Then the minimizer 
/;,n,,(p) = argmin^^^p^(^)cp|Q;:(e,/) 

is unique and can be constructed as follows: Let p = {0 =: Ho/'^ < Hi/n < 
n2/n < ... < Kijn < 1 =: /ci+i/n} G Vn{U). Consider the block matrix 



A{p) 



( B{1) \ 

V B{l + l) ) 
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where the blocks B{r) are given by the Hr — ^r-i x — band matrices 

\ 



/ -1 1 
1-2 1 



B{r) 



\ 



1 -2 1 
1 -1 



Then 



f*^,n,giP) ■= -fJ'^'Tn ° R{n'^ A{p) , fl"^) O 7r„^* 

where Rln"^ A{p) , fi^) denotes the resolvent and T^nQ^ the transpose vector. 



Remark. As in the continuous case, the block structure of A{p) corresponds 
to a direct sum decomposition of underlying space and operator. 



Proof: Analogous to the proof of Proposition |21 minimization of BZ for fixed 
partition e, writing shortly = /(^), equivalent to 

K=0 K = 

= ^2((r+i - r)(i - e.) + (r - r-'){i - e.-O) + -(r - gnifj) 

for all K = 0, n — 1. This system of linear equations can be written as 

{n^A{p) - fi^)l = -fi'^Tir.g', 

hence 

Kna = ^"(/) = -f^^rn{n^A{p)-fiy^'K^g' = -fi^T^R{n^A{p), ^x^) Tir^g'. 

■ 

Hence, we obtain in analogy to Corollary |21 
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Corollary 3 (Reduction for Blake-Zisserman) Let 7,/i > 0. The 
minimization of the Blake - Zisserman functional is equivalent to the min- 
imization of the reduced Blake - Zisserman functional bz"^^ : V{U) —>■ M 
given by 

For p G Vn{U), the reduced functional is given by 



bz"^,g(p) = 7 J(P) - {9n, flnM) + \\9n 



|2 



4 Continuity of the Reduced Functionals 

So far, we have seen that the minimization on the function space can be 
reduced to a minimization on partition space. However, it is not yet clear 
whether the reduced functionals ms and bz depend continuously on the pa- 
rameters and/or the partitions. In order to prove this, we first have to 
investigate the continuity properties of the minimizers for a fixed partition - 
from now on shortly denoted partition solvers - introduced in the preceding 
section. 



4.1 The Partition Solvers 

We consider continuity properties of the partition solvers. It turns out that 
they depend continuously on the parameters. Since the partition solvers for 
the Blake - Zisserman functional can not be applied to all partitions, we first 
have to define a proper domain. 



Definition 13 Recall the definition of 

T := {1/n : n G N} U {0} C M 

from Theorem^ Adopting the convention that Vi/o{U) =V{U) let 

E(T) := {{t,p) G T X V{U) : p G Vi/t{U)} C M x V{U) 

equipped with its relative topology, i.e. all accumulation points are of the 
form {0,p), p G V{U). 
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With these conventions, the result about the parameter dependence of 
the partition solvers reads as follows. 

Theorem 2 For fixed g G L?'{U), i/ie partition solver map /* : ]R(|"xE(T) — »• 
L'^iU), given by 

f*{fi,t,p) := f*,i/t,g{p) 

is continuous. 



Proof: The statement follows from the series of Lemmas proved below. By 
Lemma El IHl /* is continuous in E(T) for fixed fi. By Lemma |3J /* is 
equicontinuous in fi for all converging sequences of partitions. That implies 
joint continuity. ■ 

Before we come to the lemmas implyint Theorem El we apply the theorem 
to prove lower semi - continuity of the reduced Mumford - Shah functional. 

Corollary 4 The reduced functional ms^^^^g is lower semi - continuous. 



Proof: By Corollary^ the function j : V{U) — N is lower semi - continuous. 
By TheoremEl /* depends for fixed 7, /i, t = continuously on p G V{U). By 
the explicit form of ms-y^^^g given in Corollary [21 that implies the statement. 
■ 

The first lemma states that a family of partition solvers associated to a 
convergent sequence of partitions depends equicontinuously on the parameter 
/i > 0. 

Lemma 4 (i) Letp G V^U) be fixed. Then the map Goo '■ ^0 L^{U) given 
by Goo(yu) := /^,oo,g(p) Lipschitz continuous, (ii) Let p G VniU) be fixed. 
Then the map Gn{^^) '■= f^ngip) '■ ^0 ~^ -^^(^) ^■^ Lipschitz continuous. 
(Hi) Let Pn.neN be a family of partitions such that Pn G Vn{U) converges 
to p E V{U) with respect to Hausdorff metric. Then the associated family 
Gn,n=i,2,...,oo Uniformly Lipschitz. 
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Proof: (i) Recall the notations from Proposition El By definition, all parti- 
tions are finite. Thus it is enough to show Lipschitz continuity for one single 
interval, i.e. for a subinterval 9 := {x,y) G t{p), the map 

/e(/i) ■.= -fi'R{Ae,fi')g 

is Lipschitz continuous. Since the Neumann laplacian is self-adjoint and has 
a discrete spectrum with semisimple non-positive eigenvalues and its kernel 
consists exactly of constant functions, we have the spectral decomposition 



-fi^R{Ae,fi^)g = Eog + J2 



S>1 



/i2-A,(e) 



Esg 



where E^g denotes the mean value of g in Q, i.e. the orthogonal projection 
of g onto constant functions. Hence, by > Ai > A2..., we obtain 



||/e(/i)-/e(/i') 



< 



E 

S>1 



12 



Esg 



-(/.V^-A,(e)(/.2 + ^'2) + ^2(0)). 



S>1 

2^^T77^\\Esg\\ < 



\Esg\ 



2\2 



s>l 



A^(0) 



A?(e) 



Hence /e is Lipschitz continuous with Lipschitz constant Ce := ||(7||/|Ai(0)| 
depending on the norm of g and on the spectral gap |Ai(0)| of the Neumann 
Laplacian on G. That implies 



'2 I 



\GM - GMW < C\fi^ - ^i' 



where C := maxegt{p) Cq < 00. (ii) Recall the notation from Proposition El 
By definition, all partitions are finite. Thus it is enough to show Lipschitz 
continuity for one single interval, i.e. for a subinterval 0^ := [nr-i/n, Kr/n) C 
U and the corresponding map 



/e.(/u) := -/uV„i?(n^5(r),/i^) 7r„5(*. 
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B{r) is a symmetric matrix and its kernel consists exactly of constant vectors. 
Thus, we have the spectral decomposition 

^ fi^ — n^Xs[B{r)) 

where PqU denotes the orthogonal projection of onto constant vectors. 
Hence we obtain using the fact that conditional expectation is a projection 
(which implies \\gn\\ < llfi'll) 



ll/e.(/^) - /e.(M 

E 



/M|2 



2 /2 H 2 



/i2 - n2A,(5(r)) /i'2 -n2A,(5(r)) 



iTnP^TCng 



t||2 



where \Xi{B[r))\ is the spectral gap of B{r). By the same argument as above, 
Gn is Lipschitz continuous, (iii) By the preceding parts of the proof and by 
Lemma n it is enough to show the following: 



Let G„ := [kI~^^ /n, K^J^i/n] be a sequence of intervals tending to G := 
Hausdorff metric. Then the spectral gap \\i[B'^'^\r))\ of the associated block 
matrices is uniformly bounded below. 

Note that the case k"*" = i.e. G = {k^}, is included in the following 
considerations. First of all, by Lemma ^ Hausdorff convergence G„ — G is 
equivalent to limKr"''/n = limK^'^\/n = . That implies 

(n) (n) 

_,_ - , / -In 
= K —K + o[n ). 

n 

On the other hand, the eigenvalues of B^^\r) are well known to be 

A, := 2 f cos ( ^(£^^ - 1 



m 



where s = 1, e.g. Iin], Theorem 1.3). Thus, the largest 
non-zero eigenvalue is given by 



Ai(5(")(r)) = 2(cos (;-^) - l) 
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and hence 

n2Ai(E(")(r)) = 2n^ (^cos(^^^^|-^) - l) = 2n^ (^cos( 

r^ + o(n-i). 



n(K+-K_+o(n^l)) ' 



That imphes finally that for all e > there is some no with 



for all n > uq where L is the maximal length of an interval in l{p). Hence, 
the family Gn is uniformly Lipschitz. ■ 



The next result shows that the partition solver associated to the Mumford 
- Shah functional depends continuously on the given partition. 

Lemma 5 The function f^^ao,g • '^(U) SBV2(f/) C L'^{U) is continuous 
ifV{U) is equipped with the Hausdorff topology. 



Proof: First of all, recall that there is a strongly continuous representation 
of the ajjine group Aff(M) := {La,bix) := ax + b : a > 0,6 G R} on L^{R) 
given by p{Lafi)f ■= f{ax+b). In particular, given a sequence (a^, bg) tending 
to (1, 0) as s tends to infinity, we obtain 

hm ||p(L,^,,J/-/|| = 

for all f E L"^ (M) . By Lemma ^ it is again sufficient to prove that for a 
sequence 9„ := {k^I'\k^^^) of intervals such that 9„ := in 
Hausdorff distance, we have 

lim \\^I^R{Ae,^i')g-^I'R{Ae„,^^')g\\=0. 

n—foo 

(i) Let 7^ K_. and therefore Ln{x) := WnX + f „ : R ^ R the unique linear 
map such that Ln{K^- ^) = k_ and L„(k^^) = Now L„ maps the domains 
of the corresponding Neumann laplacians, i.e. we have (7 o L„ G T>{Aq^) 
g e ^'(Ae), and furthermore 

^eAg ° Ln) = wl (Aeg) o L„. 
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In the spirit of the remark made above about the representation of the afhne 
group, we write Pn{f) •= / ° -^n- In order to avoid difficulties with the 
domains, we wiU write ge for the restriction gxe of a function g to the 
interval having in mind that due to the support of the resolvent kernel, we 
always have that 

R{Ae, fJ,'')g ^ R{Ae, f/)ge 

is supported on ©. Now 

R{Ae„,fi^){ge o L„) = w-^ {R{Ae, w-^fi^)ge) o L„ 

and by Hs'eo^nll = Wn^^^WgeW and \\n'^R{Ae, j^"^)]] = 1 for all > 0, 6 e 
we therefore obtain letting g{n)Q := ge^ o = Pn^igeJ 

\\fi^R{Ae, p^)ge - p^R{Ae„, I2^)ge^ \\ 
= ||/i^i?(Ae, /i^)fi'e - (/i^w^^i?(Ae, iJ,'^w~'^)g{n)e) o L^W 

< Wii^RiAe, i?)ge - {i?R{Ae, i?)ge) o L„|| 
+ ||(/x2i?(Ae,/i')(^0 - g{n)e)) o L„|| 

+ II ((/x2i?(Ae, - /x'w-'i?(A0, p''w-''))g{n)e) o L„|| 

< ||//'i?(Ae,/x')(?e -p„(/x'i?(Ae,/i=^)(7e)|| 
+ ||p„ (/i2i?(Ae,/i')((?e-^7We)) II 

+ \\pn {{p^R{Ae, p') - p'w~^R{Ae, fi'w-^))g{n)e) \\ 

< \\p^R{Ae,p^)ge-Pn{p^R{^e,P^)9e)\\ 

+ \\Pn\\ he - 9{n)e\\ + ^p^PnW \\9{n)e\\ |1 - ^"^1 

using the resolvent identity for the final step. Note that 

\\9e-9{n)e\\ = he - Pn\9e) + Pn\9e) - Pn^geJW 

< lbe-p;'(ye)|| + ||p;'(^G-yGj|| 

< lb0-p;^(^0)|| + ||p;:^||||(X0-XeJ^|| 

and that implies the statement by Vn) — > (1, 0) and strong continuity of 
the representation, (ii) If, on the other hand, e. the sequence 

tends to a single point, we have, again by contractivity of /i^i?(Ae„, //^), 

\\p'RiAe^,p')geJ\ = \\p'RiAe^, p')gej\ < \\9eJ 
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which tends to zero as n tends to infinity. 



Since the sets VniU) are discrete, we do not have to prove a corresponding 
result for fixed n ^ oo. It remains to show convergence for partition solvers 
to partitions p„ G VniU) that converge to a partition p G V{U) which is not 
contained in Vn{U) for any n. 

Lemma 6 Let pn G Vn{U) be a sequence of partitions converging to p & 
V{U) with respect to Hausdorff metric. Then 

lim /;,n,g(Pn) = /;,oo,g(p)- 

n — ^oo ^ ^ ^ ^ 

Proof: Recall the notations from Corollary I2l3t respectively. By Lemma 
HJ the proof consists of considering the following two cases: (i) Let 9 = 
K+) G l{p) and 6„ G i(p„) such that 6^ ^ 6. Then 6„ = ^^^) 
and we have to show that for g G L'^{U) 

as n tends to infinity. We observe that, by cancelling — /i^, this can be done 
by proving strong resolvent convergence of the operator 

An ■■= TnU^Birn) O 7r„ 

to the Neumann laplacian Ae. That follows from 

/i"^(A„-/i^) = /i~^(r„n^5(r„)7r„-/iV„7r„ + /i^(r„7r„-l)) 
= r„/i~^(ra^5(r„) - /i^) 7r„ - (1 - r„7r„,) 

where r„ 7r„ is nothing else but conditional expectation with respect to an on 
the subinterval '"''"j/"^ ). Hence for all / G ©(A^), we have by 7r„(l — 

Tn TTn) = 

Tn Riri^Bivn), /i^) 7r„(A„ - /i^)/ 

= Tn TTnf - R{n^B{rn), /i^) Vr„(l - Tn Tin)/ 

= TnlTnf- 

Together with TnTi'nf fe this implies that the resolvents i?(A„,yU^) tend 
to the same limit as the operators TnR{n^B{rn), ii'^) vr„ strongly as n tends 
to infinity. 
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It remains to prove strong resolvent convergence of A„ to Ae. By jH], 
Corollary 1.6, p. 429, and since {/ G C°°(9) : f'loQ = 0} forms a core for 
the Neumann laplacian, we just have to prove that 

hm ||A„/-A©/|| = 

n—>oo 

for all / in the core. Let thus 

f{x) = f{xo) + {x- xo)f'{xo) + ^f"{xo){x - XoY + R{x) 

be a Taylor expansion of / with remainder |-R(x)| = 0(|a;|^) uniformly for 
X in a compact interval containing 9. Then, controlling the error by Taylor 
expansion yields 

An/ = E (/"(-) + 0(l/n))x[.,^) 

-{nf'{K^) + 0(l))X[^_:Eni±i) + {nf'{K+) + 0(l))Xr --n+i-i -r„+, . 

Hence, the boundary conditions = f'{i^-) = imply strong conver- 

gence of i?(An,/i^) to R{Ae,fi'^) uniformly on compact sets /i G [a,b] C M^, 
6 > a > 0, contained in the resolvent set. By Lipschitz continuity estab- 
lished in Lemma|3]this extends to uniform convergence of —fi'^R{An,fi'^)g to 
— /i^-R(Ae, for /i G [0,6]. (2) For a sequence of intervals collapsing to a 
point, uniform contractivity ||/i^-R(A„, < 1 implies convergence of the 
corresponding partition solvers to G L'^iU) as in the proof of the preceding 
lemma. ■ 



5 Compactness of the Set of Minimizers 

By the reduction principle, the set of partition solvers to a fixed function 
g G L'^{U) contains the set of minimizers of F. Now we will prove, that 
the set of partition solvers is compact, which implies the same for the set of 
minimizers. Let g G L'^{U), b > and 

Mt{g) ■■= {f;,n,gip) : pGP„(f/),/xG [0,6],nGNU{oo}}. 
In this section, will prove 
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Theorem 3 For all b > and g E L'^{U), the set Mb{g) C L'^{U) IS com- 
pact. 

This resuh imphes the existence of global minimizers. 

Theorem 4 (Existence of Minimizers) Let 7,/i > and u E L?'{U). 
Then: 

(i) The set of global minimizers f* o/MS^,^,g is non-empty, 
(a) The set of global minimizers f* o/BZ"^^ is non-empty. 

Proof: (i) Recall that the reduced Mumford - Shah functional is given by 

^^7,f^Ap) = 7j(p) + yf - {g, fl,oo,gip))- 

By Theorem 121 minimizing MS^^^^g on L'^{U) is hence equivalent to the min- 
imization of 

7j{p{f)) + \\9\\'-{f,9) 

for functions / in the compact set Ai^{g). For 7 = 0, this function is 
continuous and hence assumes its minimum on the compact set. For 7 > 0, 
the function is lower semi - continuous by Corollary ^ and assumes hence 
as well its minimum on a compact set. (ii) The set Vn{U) is finite, hence 
the minimization can be reduced to the minimization with respect to a finite 
number of partition solvers which implies that the functional assumes its 
minimum. ■ 

Now we start with the proof of Theorem El 
5.1 A Preliminary Lemma 

First of all, we will prove a lemma that will simplify the discussion consider- 
ably. It states that a proof of compactness for all g E L'^{U) can be reduced 
to a proof for a total subset of signals in L'^{U). Denote therefore by 

Mb ■■= {g E L'^iU) : Mb{g) C L'^iU) compact} 

the set of all signals, such that M.b{g) is compact in L'^{U). Then we have 
the following statement: 



31 



Lemma 7 For all b > 0, the set Aih C LF'iU) is a closed linear subspace. 

Proof: (1) Let gi,g2 G Mb, Ai, A2 G R. Then, by linearity of the partition 
solver 

and hence Mbi^iQi + ^292) C \iMh{gi) + )^2Mb{g2)- The right hand side is 
compact by continuity of the linear operations on L'^iU). (2) Let g^ e J^h be 
a sequence of signals converging to g ^ L'^iU). The mappings g 1— > f^^n,gip) 
are contractions. Hence 

\\fm(P) - f^,n,9iP)\\ ^ Wdk - 9\\, 

i.e. the sets Aibigk) converge to A4b{g) as closed subsets of L'^{U) with re- 
spect to Hausdorff metric. That implies that M.b{,g) is compact, and therefore 
Alft is closed. ■ 

The preceding lemma implies immediately, that if M-i, contains a total set, 
i.e. a set of functions such that its closed linear hull equals L'^{U), it already 
equals the whole space. Such a set is provided by the Heaviside functions on 
U. 

5.2 Compactness 

Denote by 

nU) := {xa ■■= X[a,i] : < a < 1} C L\U) 

the set of Heaviside functions on the interval U. The linear hull of Hi^U) is 
provided by the step functions which are dense in L'^{U), hence 'H{U) forms 
a total set. The key result for compactness reads 

Proposition 4 For all Xa G ^(^) we have 

MiXa) ^ {f & MbiXa) : |p(/)|<6}, 

and M-biXa) is compact. 

Proof: Let p = < xi < ... < Xr < 1} be an arbitrary partition such that 
a e [xk,Xk+i). Then, Xa and Xa,n E{xa\crn) are constant on [0,Xk) (equal 
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to zero) and [x^+i, 1) (equal to one). Since the kernels of Ae, A„ are provided 
by constant functions, vectors, respectively, the respective resolvents map the 
conditional expectation Xa,n onto functions which are constant (and equal to 
zero) on [0,x„_i) and constant (equal to one) on [xk+2, !]• That implies, they 
have at most six jumps points, contained in the set {xk-i, Xk, Xk+i, Xk+2} U 
{0,1}. By Lemma ID (2) and Corollary [2 the set {p E V{U) : \p\ < 6} is 
compact as a closed subset of a compact set and hence the closed subset 

C := E(T) n T X {p G V{U) : \p\ < 6} 

is as well compact. That implies finally by Theorem |21 compactness of 

-M6(Xa)cr([0,6]xC). 



Remark. Recall that by Definition the notion of jump differs from the 
notion of discontinuity in the discrete setting. 

By Lemma [7[ that completes the proof of Theorem |21 

6 F-Convergence and Weak Derivatives 

The notion of F-convergence is important for the investigation of minimiz- 
ers. It provides some rather general sufficient condition for the possibility to 
approximate minimizers of a given functional by minimizers of a sequence of 
approximating functionals. 

Definition 14 Let X he a metric space and Fj : X —>■ be a sequence of 

functions. Then, Fj converges to F in the sense of T -convergence - in the 

p 

sequel we will write shortly Fj — > F^o - if 
(i) For all X E X and all sequences Xj —^xwe have 

Foo(x) < liminf F,(a;j). (8) 
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(ii) For all X E X there is a sequence Xj — ^ x such that 

Foo{x) > limsupFj(xj). (9) 

Essentially, F-convergence is important due to the following facts (cf. Theo- 
rem 5.3.6 of IH) 

p 

Theorem 5 Suppose Fj F^o and denote by argminF the set of minimizers 
of F. Then 

(i) For any converging sequence Xjj(z^, xj G argminFj, we have necessarily 
limj^oo Xj G argmin F^o ■ 

(ii) If there is a compact subset K G X such that ^ argmin C K for 
large enough j , then argmin Fqo ^ and 

lim (i(xj, argmin Foo) = 

for any sequence Xjj^^, Xj G argminF,-. 
(Hi) If, additionally, argminFoo is a singleton {x^o} then 

for any sequence Xjj(zfq with Xj G argminFj. 

In the sequel, we will prove that the Blake - Zisserman functionals BZ" 
converge to the associated Mumford - Shah functionals MS in F— sense as 
n tends to infinity. By the compactness results established in the preceding 
section, this implies convergence of the associated minimizers. 

The crucial step will be the understanding of the behavior of the Blake - 
Zisserman penalty (see Definition [TT|) as n tends to infinity. For reasons 
that will become clear in the sequel, we prove a parameter dependent result. 

Proposition 5 Let 7„, /i„ sequences of non - negative numbers converging 
to 'y,fi > 0. Then the following two statements are valid which imply F- 
convergence of ^ to the Mumford - Shah penalty to parameters 7,/i.' 
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(i) Let f = t + F & SBV2(f/). Then there is a sequence fn,nm converging 
to f in LF'{U) such that 

71-2 ^ „i 

limsup5^min{^(/„(^)-/„(^))2,7n}<7j(/)+/""' / 

(^izj Lei / G L'^iU). For all sequences fn,nm converging to f in L'^iU) we 
have 



n-2 



liminf ^min{-^(/„(^) - /„(^))2,7„} 



n— »oo 

K=0 



> 



oo else 



A'^ofe t/ifli in the case fi = 0, the right hand side of both inequalities is only 
finite for f E T{U) C SBV2(f/), since f' = exactly for those functions. 

To prove Proposition we have to collect several facts about the relation 
of weak differentiation and approximation by step functions. This will be 
done in the following two subsection. The final proof of Proposition is 
given in 16.31 

As an introductory step, we discuss weak differentiability in L^(f/). 

6.1 A Characterization of Weak Differentiability 

A function / is Sobolev - differentiable on the one-sphere, i.e. / G H^{S^), if 
and only if / G L'^{S^) and its Fourier coefficients f{k), G Z with respect 
to the orthonormal base 0fc(t) := e^'^*'^* satisfy 

J2k'\f{k)\'<oc (10) 



(see for instance jlHI, Section 3). Although L'^iJJ) and L'^{S^) can be identi- 
fied by considering as the identification U / {0, 1}, the situation is different 
for the Sobolev spaces H^{Vt) and H^{S^) {Vt = (01, ), cf. Definition ITU. By 
Sobolev 's embedding theorem, H^{Q) C C{U), and / G H^{Q) can be iden- 
tified with some / G H^{S^) if and only if /(O) = /(I) for the continuous 
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representative of /. Hence, H^{S^) C H^{Q) is a linear subspace of codimen- 
sion one. In order to prove F-convergence for the Blake-Zisserman functional, 
the first step will be to find a suitable analogue of the characterization (jlOp 
for H\n). 

Lemma 8 Equivalent are 

(i) f e H\n), 

(a) there is some a G R such that f — Qa & H^{S^) where ga{t) = O-t, 
(Hi) there is some « G R such that 



a 



2 

< oo. 



where f{k) denote the Fourier coefficients of f , 

In case f G H^{Q) we have for a; G R as above 

• 1 rl 



J2bmkf{k)-a' = [ \{f-g^y\^dt= [ \ffdt-{f{l)-f{0)f 



hence a = /(I) — /(O). 



Proof: (1) First, we show that (i) implies (ii). Let / G H^{Q). By Sobolev's 
embedding theorem, there is a representative /o of / such that /o G C{U). 
Let (p G C°°{U) with d^ cj) / dt^ {Q) = d^(l)/dt^{l) for all k = 0,1,... and a : = 
/(l)-/(0). Then 

f (f'-a)4>dt = mif~9a)\l-[ {f~9a)<P'dt 

^0 

and hence /' — a is the weak derivative of f — Qa in H^{S^). Since U is 
compact, square-integrability of /' implies square-integrability of /' — a. (2) 
To see that (ii) implies (i), let f - ga e H\S^) and G C^{n) C C°^{S^). 
Then, since ga G H^{Q), 

f f<P'dt = f\f-g^ + g^Wdt = - [\{f - gj + a)<pdt. 
Jo Jo Jo 
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Hence, / is weakly differentiable on Q with square integrable weak derivative 
f — if — Qa)' + Oi- (3) The equivalence of (ii) and (iii) follows from 

f'{k) = /' die-'-''' fit) = /(I) - /(O) + 2mkf{k) 
Jo 

and the equivalence of (i) and (ii). ■ 



Lemma 9 Let a family of moUifiers be given by 

{-X < X < a 
a — 

1 a < X < 1 — a 

1(1 -x) l-a<x<l 

where Q < a < 1/2. Then, if f e L'^iU), we have fha G L'^{U) for all a > 
with 

WfhaW < 11/11 

and fha converges to f in L'^{U) as a tends to zero. 

Proof: Clearly < /ia < 1 and /ia — >^ 1 in L'^{U) as a tends to zero. Hence 

Wfhaf = f ifKdt < f \f\'dt - 

Jo Jo 
and 

\\fiha-l)r< r\f\'dt+ f \f\^dt 
Jo Jl-a 

which tends to zero as a tends to zero. ■ 

These simple observations are the starting point for the constructions in the 
next sections. 

6.2 Approximation and Weak Differentiability 

Given e %i{U), we consider 

F(")(x) := n(/(")(a; + 1/n) - f^''\x)) 
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where we understand /^^^ to be continued periodically to the real axis. Then 

n-2 

Now we rewrite the integral in terms of the Fourier coefficients of F^'"^ with 
respect to the orthonormal base 

{, 2'Kinkt T 

v^e^ >o<t<i-i 
,l-^<t<l 

on L^([0, 1 — ^]). As n tends to infinity, the individual base vectors tend to 
4)k{t) := e^'^*'^* and even the Fourier coefficients converge as we will show in 
the next lemma. 

Lemma 10 Let f^^ E N be a sequence of step functions with /^"^ e %i{U) 
converging to f in L'^{U), and F^") as above. For fixed /c e a e (0, 1/2) 
we have 

lim(F(")/i„,C)-0fe) = O. (11) 



Proof: (1) First of all, we show that 

hm (F(-)/i„,C)-C^) = 

n— »oo 

where 



[12) 



Tl 2ninkt 

e . 



n — 1 



Assume without loss of generality that n is so large that 1/n < a. Then by 
Cauchy - Schwarz inequality 



(n)| 



(»)| 



iJl^dt{l-t)^ 2||/(")|| 
^ a2(n- 1) - ^3{n - 1) 
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This expression tends to zero since, by assumption, ||/^"^|| — > ||/|| < oo. (2) 
Now we have by substitution oi u :— t -\- using periodicity and \ha\ < 1 



< nl dt\{f-\t + l/n)-f^Kt))ha{cP):>-<Pkm\ 



_ fin)/ 



< nl du\f^\u){4>'C\u-l/n)-<P'^>{u) + Mu)-Mu-'^M)\. 
Jo 



But now 



2 



n 
An 



n . Ivikt 1i\iht - 

— ^(1 — e )(1 — e " y 



1 — cos 



27ikt\ 



n — IV n — l) \ n 



1 — cos ■ 



2nikt\ 



Hence 



|(F(«)/^„0i-)-0,)l = o(i; 

tends to zero as n tends to infinity. 



Lemma 11 Let E N be a sequence of step functions with /^"^ e %i{U) 
converging to f in L?'{U), and F*^"^ as above. Let f{k) be the Fourier ex- 
pansion of f with respect to the orthonormal base (pkif) '■— e^'^**^* and ha as 
above. Let 



a,(a,n) ^ (^j'^ f^'^Ukim - j^^ f^^^t,{t)dt 

and ak{a) :— hm„_^oo CKk{(i, n). Then 

hm F^a{k) = 2mkfha{k) + ak{a). 



(13) 



Proof: The function /ia^fe is everywhere left-dijjerentiable with left differen- 
tial 

{ha(t^k)-\t) ■■= lini' 
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Hence, by substituting t + 1/n = m, we have 

F^aik) = n !\f^\t + l/n)- f^\t))K(t),{t)dt 



-n I f'-'^\u){ha(f)k{u) - ha4>k{u - l/n))du. 
'o 



The function ha(f)k is uniformly Lipschitz with Lipschity constant Ca^k- Hence 
\n{ha(pk{u) — ha(pk{u ~ l/''^))! — C'a.fc and thus by dominated convergence 

hm F^a{k) 

n— >oo 

= - hm f f'^KhM'.du 

= - hm /" f''\2'Kikha~K_)<Pkdu 

= -27ri/c hm / /("^/i^^fcrfw + hm / f^'^^-{x[o,a)-X[i~a,i))<Pkdu. 
Convergence Z*^"^ f finahy imphes the statement. ■ 



Lemma 12 Let ak{a) be as above. Let am ^ be a sequence of positive 
numbers such that hm^^oo ctfc(flm) = ctfc 7^ ioo exists for some A; G Z. Then 
this limit exists for all k E and equals aQ. 



Proof: / G L^{U) imphes / G L^{U). On the other hand, there is a constant 
Ck>0 such that |1 < Ckmm{\t\, \l-t\} for aU t G [0, a] U [1 - a, 1]. 

Using (|13|). the statement foUows now from 



\ao(a) - ak(a)\ < 



a 

< Ck 



\f{l-^k)\dt+ / \f{l-(Pk)\dt 



l-a 



^\f\dt+ f \f\dt 



which tends to zero as a — > 0. 

From this considerations, we obtain, having in mind inequahty 
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Lemma 13 Let /*^"^ G Tn{U) a sequence of step functions converging to f 
in L'^(U), F*^") as above. Then we have 



K=0 



Proof: We have by Lemma IHl for all a > 



Jo 



fcez 



and all summands are non-negative. Hence by Fatou's Lemma and Lemma 
ITUlandfTTl we have for all a > 0: 



liminf V > Vliminf 

= |ao(«)P + ^ \^27rikfha{k) - ak{a] 



Vliminf |FW/r„(A;) 



By Lemma 1121 either liminfa^o |cto('2)| = oo, or for all sequences am — *■ 
for which the limit lim^^oo |cto(t*m)P = Ic^oP exists, we have as well 
limm^oo |«fc(«m)P = l^oP for all k E Z. Taking such a subsequence, we 
obtain by Fatou 's Lemma and Lemma IHl 



lim inf 



> lim inf 

m— >oo 



'{F^''\x)Ydx 



' + J2\^^^kfhZik) 



> lim |ao(am)p + V' lim \2Tiikfha^{k) 



> lim |ao(a.m)P + 1™ 27rikf{k) 
= [app + |27rifc/(A;) — qq 
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By Lemma IHl / G H^{Q) if and only if there is some a G M such that the 
sum on the right hand side is finite. Hence, if / ^ H^{Q), the right hand 
side is always infinite. If / € H^{Q), f has a continuous version and thus 
lima^o C(o{ci) = /(O) — /(I) which implies by Lemma|Hl that the limes inferior 
equals / \f'\'^dt. m 



The result corresponding to inequality reads as follows: 

Lemma 14 Let f G H^{Q), := £'(/|cr„) the conditional expectation with 

respect to the sigma-algebra an and F^'^^ = n(/(")(x + l/?2) — /(")(x))as above. 
Then 

limsup /" " \F^''\x)\^dx < [ \f'ix)\^dx. 



Proof: By the definition of conditional expectation and Jensen's inequality 
(applied to the probability measure ndx on l^^, ^)) we obtain 

2 



ra-2 



n-2 



< 



n-2 

J2 lJn{f{x+l/n)-f{x)fdx 



/t=0 



By Lebesgue's differentiation theorem 
convergence 

n{f{x + l/n)-f{x)) 



X{[x,x + l/n)) 

to f'{x) for Lebesgue-almost all a; G /. On the other hand 

1 



n I {f{x + l/n)~f{x))dx 

n 

i-i 

+ l/n) - f{x)fdx. 
(7.2) Theorem, p. 100) we have 

x+l/n 

f{u)du 



|n(/(x + l/n) - f{x))\ < 2S*{x) := 2sup 

V \v\ 



\f\u)\du, 



where V is any open sub- interval V <Z I such that x & V . The function S* is 
not integrable, except for / = almost surely (see [20], p. 105). But by the 
Lemma of Hardy-Littlewood ([201, (7.9) Theorem, p. 105) there is a constant 
c > 0, such that 

A({x G t/ : S*{x) >a})<- [ \f'iu)\du < oo. 
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since L'^{U) C L^iU). That implies 
X{{x eU : \n{f{x + l/n) ~ f{x))\> a}) < X{{x e U : 2S*{x) > a}) 

< - [ \nu)\du 



« Ju 

and the sequence of difference functions n{f{x + 1/n) — f{x)) is therefore 
uniformly integrable. Thus, we may interchange hmit and integration which 
imphes the statement. ■ 



6.3 Convergence of the Smoothness Penalty 

As a consequence of the considerations in the preceding subsection, we prove 
now Proposition the corresponding F-convergence resuh for the Blake - 
Zisserman penalty ^^^f^^u- Note that the case /i = requires some care. 

Proof: (i) Let / = F + t G SBValt/). From the sequence /n,neN? fn ^ %%{U) 
we construct the decomposition fn-=fn — fn where 

n-l 

K=0 

We thus have /n = /,f + fn S'^d f^-^thj boundedness of t, hence as well 
f^ — >• F. Furthermore fni'^) ~ fni.^) = if the interval (^, -^j contains no 
jump of the step function t and there are only finitely many intervals that 
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contain a jump, namely at most j{t). Hence 



n-2 



J]min{^(/„(^)-/„(^))^74 

J2 min{^(/„(^)-/„(^))^7n} 

{K:{^,^]np(t)=0} 
+ Yl mM^(/n(-±^)-/n(-)f,7n} 

E min{-^(/„^(^)-/„^(s))^7n} 

{K:(^,^]np(t)=0} 

+ ^^iMfni^) - /n (f) + fni^) " /n(S))^7n} 

{^:(^,^]np(t)^0} 

Now we consider the sequence of conditional expectations := E{f — t\ an) 
Then := E{F \ an) and we have by Lemma ITU 



limsup J2 min{-^(/^(i±i) -/^(^))2,7,} 

""""^ {K:{ii,i£±i]np(i)=0} 

n-2 „i 



< lim5^^(/^(^)-/„^(f))^ = /i;M |/'(x)pda;. 

K = "^^ 

For the analysis of the exceptional intervals, we use the fact that F is abso- 
lutely continuous (see ^20j, p. 115). Therefore, for all e > there is an uq, 
such that for all n >no and k = 0, n — 1 we have 

sup \F{x) - F{x')\ < e. 

By the contraction property of conditional expectation that implies for all 
n > no, K = 0, 77, — 1 that |/^(^) — fni^)\ < 2e. Choose now no so large 
that e < 5/4 where 6 := minj,.gj(() \t{x) — t{x~)\ is the heigt of the smallest 
jump of the step function and additionally, such that all exceptional inter- 
vals contain exactly one discontinuity of t. That implies for the exceptional 
intervals 

l/n (-^) - /n^(f ) + ' /^(f )l > ^4- 
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Thus for all n with n > no and n5^/16/i^ > 7„ we have 

{K:{^,i^]np(t)^0} 

= mm{n6^/lQfil,-fn} = 1nj{t). 

(ii) Let / e L\U), fn G %{U) with /„ /. Then 

re-2 

5]min{^(/„(^)-/„(f))^7n} 

where a„ := {k < n - 2 : min{^(^„(^) - ^„(t))^7} = 7} and < : = 
{0, n — 2} — a„. Without loss of generality, we consider the subsequence 
of /„ with 

n-2 

liminf Emin{^(/„(^) - /„(f|))^7n} 

k;=0 
n-2 

= hm Y.'^ini^iffi^) - /r(f))^7n} 

K=0 

with corresponding exceptional sets a™^. The sets p™^ := {^/n : k G a™^} U 
{0, 1} are finite and hence closed in [0, 1]. Passing to another subsequence 
of f^^ if necessary, the compactness of the set of closed subsets of [0, 1] 
with respect to Hausdorff distance implies, that the sequence of sets p™^ 
converges to a closed subset c C [0,1]. By (i), convergence to a function 
/ G SBV2(f/) implies by the absolute continuity of / off the jump points 
that the exceptional set contains only finitely many points. Thus |c| = cxo 
implies / ^ SBV2(t/). 

Therefore assume |c| = < oo. Hence c G V{U). In that case. Lemma IT^ 
yields the following alternative: Either / G SBV2(t/), then / G H^{Q) for 
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all G i(c) and the limit of the subsequence is greater or equal to 

^K + ^^-' J2 Ifel'dx 
eet(c) 

where = /'le, or / ^ SBV2(t/) which implies that the limit is oo. That 
implies the statement, in particular for fi = 0. ■ 



7 Dependence on the Parameters 

In the final section, we will prove the Theorem ^ According to Theorem 
El the proof follows from F-convergence together with the fact - already 
established in Section EJ- that the set of minimizers is compact. Thus, we 
start by showing that the functionals in question depend continuously on the 
respective parameters in an appropriate sense. 

7.1 F-Continuity of the Segmentation Family 

We consider the three-dimensional (pseudo-) cube Q given by 

Q := X R+ X T 

and the corresponding family of functionals F{q) : L^{U) — > M, g G Q defined 
in Theorem H The statement about F-continuity on the cube now reads as 
follows: 

Theorem 6 Let Qs^sem with := {'^s, fJ'sjtg) E Q a sequence of parameters 
converging to q := (7,/i,t) G Q. Then 

F{qs) ^ F{q) 

as s tends to infinity. 

To prove this, the crucial point is the statement about F-convergence 
of the penalizers established in Proposition El However, we will need two 
additional lemmas, the first of which states that the discrete L^-distance used 
in the Blake - Zisserman functional converges to the continuous L^-distance. 
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Lemma 15 Let f,g& L'^iJJ), fn,nm ^i^h fn G %i{U) be a sequence of step 
functions converging to f in L'^{U) and gn,neN with gn := E{g \ cr„) the se- 
quence of conditional expectations. Then 



^ n—l 

lim-J2{fn-gnm) = \\f-g\ 



K=0 



Proof: First of all, E{fn \ Cn) = /n by a standard property of conditional 
expectation. Furthermore, martingale convergence (see e.g. ^Tj, Ch. 12) 
implies 

lim - / I or„) = 0. 



Hence 

n-l 



lim -^(/n = lim / E{fn - gWnfdx 

K=0 

= lim / {E{f^-fK) + E{f-g\cr„)Ydx= lim [ E{f - g\anfdx 

Again by martingale convergence, E{f — g \an) tends to f — g. m 

By the second lemma, we prove a result about approximation of L^- 
functions under additional constraints about the location of the jumps of the 
approximating step functions. 

Lemma 16 Let f G L'^{U) and km a sequence of non negative integers such 
that km < rn and km ^ oo. Then there is a sequence of functions 

fmEife TmiU) : < km} 

such that fm^fin 

Proof: We consider three families associated to /: 

(i) Fk := argmin^g^([;)j(^)<j^,||t — /||, k = 1,2,... - the minimum is not 
necessarily unique. 
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(ii) Fk,m ■= E{Fk I (Tm) - we have j{Fk,m) < 2A;, k,m = l, 2, .... 

(iii) fk,m ■= argmmig7-^(f;)^j.(j)<fc||t-/||, /c, m = 1, 2, ... - again, the minimum 
is not necessarily unique. 

By km oo, we have for all i^T > some Mq > such that km > 2K for all 
m > Mq. Then 

\\fk^,m - /II < \\FK,m - /II < \\FK,m " Fk\\ + \\Fk " /||. 

By construction 

lim \\FK,m -Fk\\ = 0, lim \\Fk - f\\ = 

m— >oo X^co 

and thus 

limsup||/fc„,„-/|| < ||F;,-/|| 

m^oo 

arbitrarily small and non negative. That implies the statement. ■ 

Now we can prove the theorem stated above. 

Proof: For the proof, we have to consider three different cases depending on 
the location of q. 

1st case. 7, /i > 0, t > 0: In that case — g if and only if 7^ — 7, /i^ — > /i 
and ts = t := 1/n for s > sq. That implies for s > Sq 

Fi^„fXs,ts){f)-Fi^,fi,t)if) 

r E::omin{-^(/(^)-/(^))^7.} fe%,{U) 

= \ -min{^(/(V)-/(-))^7} 

t else 

For step functions / G Tn{U), \\f\\L^{u) < C implies sup |/| < C-Jn. Thus 
Fils, f^s,ts) converges to F{j,fj,,t) uniformly on balls {||/||l2(;7) < C}. That 
implies F-convergence of the functionals. 

2nd case. 7 > 0, /i > 0, t = 0: In that case, F-convergence follows from 
Proposition El for the penalizer and Lemma IT^ for the distance term. 
3rd case. 7 = 0, /i > 0, t = 0: In that case we have Us := 1/ts —>■ 00 and 
ior fseX^U) 

d:;Us) < Bz;v.,,(/.) < risis + d-;Us). 
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Let now / G L'^iU) and fs G Tn^{U) a sequence of functions with — / in 
L2(f/). Then 

hminf BZ!;;_^^_^(/,) > hminf = hm = d,{f). 

That is condition (jHJ for F— convergence. Let now fs G Tn^U) a sequence of 
step functions such that each fs is some best approximation of / in the set 
Ms := {t G %{U) : j{t) < I/7,}. Then 7,j(/s) ^ 0, /, / and by Lemma 
Uni this sequence fulfills condition Q. ■ 



7.2 Proof of the Main Theorem 

The proof of Theorem ^ is now finally a consequence of Theorem to- 
gether with the compactness of the set of minimizers: Existence (i) of the 
minimizers is provided by Theorem |3 Convergence (iia) of the minimiz- 
ers follows from Theorem (i) together with Theorem IHl the result on F- 
convergence established above. For all sequences = {is, fJ^s,ts) converging 
to g = (7,/i, t) G Q, there is some b > with fig < b for all s. Since, by 
the reduction principle, all possible minimizers are thus contained in the set 
■Mb{g) (see Sectional), the existence of a convergent subsequence (iib) follows 
from the compactness result Theorem El 
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